首页> 外文OA文献 >GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping Using Processing-in-Memory Technologies
【2h】

GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping Using Processing-in-Memory Technologies

机译:GRIm-Filter:DNa读取映射中的快速种子位置过滤   处理内存技术

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Motivation: Seed location filtering is critical in DNA read mapping, aprocess where billions of DNA fragments (reads) sampled from a donor are mappedonto a reference genome to identify genomic variants of the donor.State-of-the-art read mappers 1) quickly generate possible mapping locationsfor seeds (i.e., smaller segments) within each read, 2) extract referencesequences at each of the mapping locations, and 3) check similarity betweeneach read and its associated reference sequences with acomputationally-expensive algorithm (i.e., sequence alignment) to determine theorigin of the read. A seed location filter comes into play before alignment,discarding seed locations that alignment would deem a poor match. The idealseed location filter would discard all poor match locations prior to alignmentsuch that there is no wasted computation on unnecessary alignments. Results: We propose a novel seed location filtering algorithm, GRIM-Filter,optimized to exploit 3D-stacked memory systems that integrate computationwithin a logic layer stacked under memory layers, to performprocessing-in-memory (PIM). GRIM-Filter quickly filters seed locations by 1)introducing a new representation of coarse-grained segments of the referencegenome, and 2) using massively-parallel in-memory operations to identify readpresence within each coarse-grained segment. Our evaluations show that for asequence alignment error tolerance of 0.05, GRIM-Filter 1) reduces the falsenegative rate of filtering by 5.59x--6.41x, and 2) provides an end-to-end readmapper speedup of 1.81x--3.65x, compared to a state-of-the-art read mapperemploying the best previous seed location filtering algorithm. Availability: The code is available online at:https://github.com/CMU-SAFARI/GRIM
机译:动机:种子位置过滤对于DNA读取作图至关重要,该过程中将从供体中取样的数十亿个DNA片段(读物)作图到参考基因组上,以鉴定供体的基因组变异。最新的阅读作图仪1)快速为每个读取中的种子(例如,较小的片段)生成可能的定位位置; 2)在每个定位位置提取参考序列; 3)使用计算昂贵的算法(即序列比对)检查每个读取及其相关参考序列之间的相似性,以达到确定读取的原点。种子位置过滤器在对齐之前开始工作,丢弃那些认为对齐不佳的种子位置。理想种子位置过滤器将在对齐之前丢弃所有较差的匹配位置,从而不会在不必要的对齐上浪费计算。结果:我们提出了一种新颖的种子位置过滤算法GRIM-Filter,该算法经过优化,可以利用3D堆栈存储系统将计算与存储层下面的逻辑层集成在一起,以执行内存中处理(PIM)。 GRIM-Filter通过1)引入参考基因组的粗粒度片段的新表示形式和2)使用大规模并行的内存中操作来识别每个粗粒度片段中的存在状态来快速过滤种子位置。我们的评估表明,对于0.05的序列对齐误差容忍度,GRIM滤波器1)将过滤的误报率降低了5.59x--6.41x,并且2)提供了1.81x--3.65x的端到端读取映射器加速,与采用最佳的先前种子位置过滤算法的最新读取映射相比。可用性:该代码可从以下位置在线获得:https://github.com/CMU-SAFARI/GRIM

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号